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1. Introduction 


Modelling protein structure seems a challenging enterprise because the number of structure 
parameters required ordinarily exceeds the amount of independent data points available 
from experimental observations. Expressing the predominant conformation of a protein in 
terms of a geometry model, a polypeptide chain consisting of N atoms would command 3N - 6 
Cartesian coordinates be fixed. Even for small proteins, this becomes a daunting number. 
Fortunately, so-called holonomic constraints limit the number of variables, leaving 
substantially fewer, truly relevant parameters for folding the polypeptide chain into its 
native tertiary structure. For example, adjusting bond lengths and the many angles between 
the covalent bonds connecting the atoms is of little concern and appropriate standard values 
can be inserted from tableworks (Pople & Gordon, 1967; Engh & Huber, 1991, 2006). Table 1 
exemplifies for the 147-residue protein Desulfovibrio vulgaris flavodoxin how the number of 
truly independent internal rotational degrees of freedom amounts to less than one-tenth of the 
Cartesian coordinate set size. 


IUPAC-IUB conventions (1970) define three mainchain torsions for the polypeptide 
backbone. Protein structure determination primarily seeks to adjust the values of those two 
mainchain torsion angles that chiefly determine the fold of the chain, that is, ọ and y in each 
amino-acid residue. Peptide-bond geometry, as described by mainchain torsion angle œ, is 
normally assumed to be trans- or cis-planar, fixed at 180° or 0° angles, respectively, and thus 
does not give rise to rotational variability. In a next instance, the sidechain torsion angle %1 is 
of interest. 


Nuclear magnetic resonance (NMR) spectroscopy is uniquely positioned to help determine 
internal orientational constraints in a molecule, be these atom-atom distances (Wiithrich, 
1986; Neuhaus & Williamson, 1989), relative bond orientations (Reif et al., 1997, 2000; 
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Schwalbe et al., 2001), relative protein domain orientations (Tjandra & Bax, 1997; Fischer et 
al., 1999) or rotational states of torsion angles (Pachler, 1963, 1964; Hansen et al., 1975; 
Bystrov, 1976; Ejchart, 1999). Especially powerful at measuring short-range interaction, 
NMR thus complements X-ray crystallography that is stronger at determining correlations 
over longer distances. 


Approach Parameters Items Coord’s 
top-down Cartesian coordinates (3N - 6) 6411 
less bond length constraints (N - 1) - 2138 4273 
less bond angle constraints 
- at tetrahedral centers (5 bond angles to fix) -417x5 2188 
— N-terminus (1) 
— mainchain C* (all residues) (147) 
— sidechain C? (all residues excl Gly) (129) 
— sidechain C’ (select residues) (85) 
— sidechain C? (select residues) (47) 
— sidechain C* (Met, Lys) and N° (Lys) (8) 
- at planar centers (3 bond angles to fix) -450x3 = 838 
— mainchain N’ and C’ (excl N- but incl C-terminus) (293) 
— sidechain nitrogen and carbon (157) 
- at angled centers (1 bond angle to fix) - 24x1 814 


— sidechain hydroxyl / thiol groups / sulfide bonds (24) 
less fixed or irrelevant torsion angles 


— mainchain o of peptide bonds -146 668 
— sidechain methyl - 84 584 
— sidechain amide / guanidinium groups - 26 558 
— sidechain hydroxyl / thiol groups - 24 534 
— N-/C-termini -2 532 


bottom-up Variable torsion angles of relevance 


— mainchain ġþ and y (2R - 2) 292 292 
— sidechain ¥; (all residues excl Gly, Ala) +112 404 
— sidechain y2 (select residues) + 84 488 
— sidechain y3 (select residues) + 30 518 
— sidechain y4 (Arg, Lys, Pro) +14 532 


Table 1. Coordinate statistics for D. vulgaris flavodoxin (R = 147 residues, N = 2139 atoms) 


We here focus on high-field NMR in aqueous solution which yields best resolution of the 
protein signals. Concentrated at around 1 mM, the dissolved protein is exposed to strong 
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fields of 11.7-23.4 T magnetic flux density, giving rise to nominal resonance frequencies in 
the proton band (1H) of 500-1000 MHz. Other biologically relevant stable isotopes 13C and 
ISN, respectively, resonate at roughly one-quarter and one-tenth of 'H at a given field. 


NMR experiments then consist of radio-frequency pulse trains that generate controlled 
transitions of spin states (Ernst et al., 1987). These transitions manifest in resonance lines 
whose spectrum location and detailed shape reveal information about the molecular 
environment in which the probed atoms are embedded. 


Rotatory states of torsion angles are accessible through measurement of so-called J-coupling 
constants which lend NMR signals a splitting fine structure due to bond-electron mediated 
pair interactions between NMR active nuclei. Put simply, the presence of each neighbouring 
(coupled) spin splits the resonance line of the observed spin into two. This splitting is field- 
independent (hence coupling ‘constant’) but does depend on local molecular geometry. 


Of particular interest to structure determination are three-bond °Jxy coupling constants as 
their magnitudes relate to dihedral angles subtended by the three covalent bonds 
connecting the coupled pair of nuclei X and Y, which can be any combination of 1H, !5N, or 
13C, prevalent in the topology of the torsion considered (Fig. 1). 


Fig. 1.9] spin-spin coupling topologies in %C,!°N enriched polypeptides. Arrows denote 
three-bond connectivities, grouped by conformational relevance to , y, and ¥, torsions. 


Multiple J data collected for a given torsion angle overdetermine that angle and thus allow its 
value to be retrieved. Six coupling constants, 3Januo Janc, 3Juncp, Jeno Jec, and %Jercp, 
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determine torsion angle (C’is-N’i-C*|-C’;) of residue i (Wang & Bax, 1996; Bliimel et al., 
1998). Another three couplings, *JHan’, 3Jcpn’, and 3Jnrn, would yield w(N’i-C%;-C’-N i+1). Of 
qualitative concern only, the planar trans- or cis-configured peptide bonds at torsion angle 
o(C%-C’j-N jn1-C%i41) can be verified by two couplings, JcgHn and 2Jcgcg, Which appear to 
reflect y geometry also (Hennig et al., 2000). Of numerous sidechain torsions encountered in 
amino acids, x1(N’;-C*%;-C";-C’) impacts most on the spatial orientation of the sidechain. Up 
to nine distinct coupling constants are accessible in amino acids (Pérez et al., 2001). 


Karplus (1963) suggested the dependence of 3J on dihedral angle 6 subtended by the three 
bonds that connect the coupled nuclei follow the empirical relation 


3](8) = Co + Ci cos 8 + C2 cos 20 (1) 


where Cnm are Karplus coefficients in Hz empirically calibrated for the 3J types encountered 
in amino acids (Bystrov, 1976; Wang & Bax, 1995, 1996; Hu & Bax, 1996, 1997; Schmidt et al., 
1999, Pérez et al., 2001). Multiples of 60° increments to 0 establish the phase relation between 
the actual internuclear dihedral angle 0 and the IUPAC-defined torsion $, y, or y1 (Fig. 2). 


J (Hz) 
N wo A o O N œœ (Co) 


-150 -120 -90 —60 —30 0 30 60 90 120 150 


Fig. 2. Angular dependence of the protein $-torsion related 3J coupling constants. 3J are at 
maximum when the bonds between the coupled nuclei are trans-oriented, and at minimum 
for perpendicular orientations. For °JHNH«, the panels show from left to right the internuclear 
angle @HNH« = > - 60° in the situations 180°, -90°, +0°, and +90°. 
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The coefficients Co signify mean J values obtained for a complete torsion-angle revolution, 
also referred to as conformation averaged J coupling constants. The differences (C2 - C1) are the 
largest deflections in J from the mean, where primary and secondary maxima of the curves at 
+180° and +0°, respectively, differ by 2C). 


Given sets of up to six 3J parameters per torsion, the challenge is now to find that angle 
value in best agreement with the experimental data. Numerical methods exploiting the 
redundance present in large data pools permit the self-consistent calibration of Karplus 
coefficients during the course of the angle refinement (Schmidt et al., 1999). This obviates the 
need for traditional referencing of conformations derived from X-ray data. 


We previously collected for flavodoxin values for the six possible coupling constants 3J(6) 
and determined the protein’s mainchain torsion angles (Schmidt et al., 1999). 
Discrepancy in @ between our NMR solution structure and comparison coordinates from 
X-ray crystallography (Walsh et al., 1998; Artali et al. 2002) is only 5° on average, which is 
smaller than the molecular dynamical angular libration due to thermal effects, indicating 
that both NMR solution and X-ray crystal structures of flavodoxin are very similar 
indeed. 


Here, we record 3J data for the enzyme Ribonuclease T1, a 104-residue protein (11 kDa) from 
Aspergillus oryzae (RNase T1, EC 3.1.27.3) that cleaves single-stranded RNA 3’-side of 
guanine nucleotides, and determine the majority of the ¢ torsion angles in the enzyme. 


2. Materials and methods 


One stringent requirement for efficient protein NMR analysis is that the protein sample be 
artifically enriched in the stable non-radioactive isotopes 1N and 8C, a process nowadays 
commonly applied in protein expression by recombinant technologies (Kainosho, 1997). 
Whilst oxygen does not play any role in protein NMR practice, the 1H isotope offering 
greatest sensitivity is ubiquitous and abundant. Sometimes, it is being depleted by 7H 
replacement in order to alleviate adverse signal relaxation effects occuring in large protein 
samples (above approximately 250 amino acids) exhibiting slow rotational tumbling rates 
(longer than approximately 10 ns rad-). 


2.1 Protein sample 


Uniformly 8C,!N-labeled RNase T1 (Lys25 isoenzyme) was obtained following established 
protocols (Quaas et al. 1988a,b; Spitzner et al., 2001) and used at 2-mM concentration in 
aqueous solution of pH 5.5 (containing 10% D20). All NMR spectra were recorded at 308 K. 
Prerequisite to any protein NMR analysis is the assignment of resonance signals to 
individual nuclei, not unlike a fingerprint of the molecule. Values quoted are chemical shifts 
in parts per million (ppm) from the respective 'H, C, or N band base frequency. 
Resonance assignments for RNase T1 (Fig. 3) are available from the BioMagRes database for 
1H (BMRB-133; Hoffmann & Riiterjans, 1988) and for 15N (BMRB-1658; Schmidt et al., 1991), 
and 13C chemical shifts were given by Pfeiffer et al. (1996b). Comparison 6 torsion angles 
were calculated from crystal coordinates, resolved at 0.15 nm, of RNase T1 complexed with 
Ca?* (PDB-9RNT; Martinez-Oyanedel et al., 1991). 


www.intechopen.com 


100 Protein Structure 


T T T T 
100} 4 
G88 
105} 8 4 
ae s 
=” G47 = 
G71 _ Ts: 
— ma 
110 SE 4 
oe” : © > a 
91 o o o o 
= Syo ° o o Soy 
£ V33 °- © o" o- assir o iss o N44 
& 1154 ká o oe Pe = os 4 
17 N84 
f s51 tan = a Be o 
a es 58 oe Be ia 
wr S etic ler, e OSS gS a 
S13 $35 gZ: 2 Resa e” 
{20l = w S $53 «989° “= Bis oo E | 
E s63 02 n v89 
S ORs pa SNI V16 > 
77 0 aI 
& Sis N98 oo 
S72 gg @ vas 80 Age 
Pe lad i ae 
125 Okun oh Me oe + 
yü oo: 
> P 
ya2 
D3 
v68 
D 
130 4 
g : 
li I i l | I i I | I i 
10.5 10 9.5 9 8.5 8 7.5 it 6.5 6 5.5 
F3 / HN (ppm) 


Fig. 3. 1H,!"N-chemical shift correlation spectrum of RNase T1 at 900 MHz and 308 K. 


2.2 Data collection 


The focus of this chapter lies on protein structure and interpretation of NMR data. The 
reader interested in the details of NMR pulse-sequence design and the setup of multi- 
dimensional NMR experiments is referred to specialist textbooks (Roberts, 1993; Evans, 
1995; Keeler, 2005; Hoch & Stern, 1996; Cavanagh et al., 2007) and to the literature including 
contained bibliography on the measurement of each of the six protein ọ related °J coupling 
constants, as given in the separate NMR Experiments section at the end of the chapter. 


Measurements of J coupling constants are subject to two principal concepts. One family of 
NMR experiments exploits frequency modulation, such as COSY, DQF-COSY, and E.COSY 
(Griesinger et al., 1987), giving rise to characteristic multiplet line splittings (Fig. 4). The 
other technique uses amplitude modulation, subdivided into J-modulation (Billeter et al., 1992) 
and J-correlation (Bax et al., 1994), encoding J in time-dependent signal intensity. Both 
approaches naturally have their pros and cons. While frequency-modulation schemes 
produce NMR signals of weaker intensity and larger space requirement, at times leading to 
signal overlap, amplitude-modulation schemes chiefly fail to measure small coupling 
constants. 


To minimize signal overlap, protein NMR spectra are often recorded in 3D mode by 
exploiting the resonance of an auxiliary spin and selecting 2D projections at that frequency. 
Example graphs in Fig. 4 show such an experimental multiplet pattern, together with its 
best-fit least-squares lineshape reconstruction and the residual error signal. 
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2.3 Data evaluation 


Whichever the chosen approach, the particular method applied to extract J values as 
structure parameters from NMR spectra must be considered an integral part of the analysis 
also. The authors gathered experience with both methods, albeit with an undeniable 
preference for E.;COSY-type spectra which perform particularly robust in connection with 
computer-assisted lineshape analysis (Schmidt, 1997a; Löhr et al., 2000). 


Contour plots of NMR signals for J analysis recorded in E.COSY mode show characteristic 
tilts (Griesinger et al., 1987), where a prominent large one-bond coupling spreads out the 
multiplet along the vertical spectrum dimension, so as to permit reading the sought smaller 
3] coupling off the frequency difference between the two multiplet halves in the horizontal 
dimension. 


In the example of Fig. 4, the 'Jcutt, of typically 143 Hz is exploited to split the signal into two 
halves along the F; dimension, given here by the 1°C* resonance frequency, and the small 
3JeHa coupling results from the frequency difference between both halves along the 
perpendicular F dimension, given here by the °C’ carbonyl resonance. 


The parameter record lists optimized values for coupling constants 'JcuH, and %9Jcr, as the 
primary E.COSY components responsible for the tilted appearance of the signal shape in 
vertical and horizontal directions, respectively. Other parameters include, apart from 
amplitude scaling, line widths and line asymmetries in both dimensions, a second 
unresolved splitting pair. 


RNase T1 - Asn44 @ F1=53.78, F2=175.38 ppm, F3=5.73 ppm 
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Fig. 4. Contour plot of a 500-MHz H(N)CA,CO[HA]-E.COSY multiplet recorded for RNase 
T1. Example analysis for extracting the °JcH,, coupling constant from the 2D BC% BC 
correlation signal (F,F2) taken at the resonance of the amide 'HN proton in the third 
dimension (F3). The three nuclei eliciting the three spectrum dimensions all couple with the 
1H* spin (Fig. 1.), whose presence, following the E.COSY principle, is seen only as splittings, 
not as another frequency dimension. 
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3. Results 
3.1 RNase T1 


Using a uniformly N and 13C stable-isotope labeled sample of the enzyme RNase T1, a 
total 512 3J values were collected, related to the polypeptide mainchain torsion angles in 82 
out of the total 104 amino-acid residues. 


Even without fitting quantitative torsion angles to the °J data, qualitative inspection of the J 
values permits insights into some details of the protein’s secondary structure already. For 
example, residue Asn44 in RNase T1 exhibits a very large 3Jena coupling of 6.67 Hz (Fig. 4), 
second only to that seen in Asn84. This is irreconcilable with a ọ torsion in the negative 
value range (Fig. 2). In addition, the 1Jcatta coupling of only 131 Hz (Fig. 4) falls well short of 
the expected average and supports a positive ¢ torsion, too. Consequently, Asn44 must 
exhibit a positive value for its } torsion. 


Signals for consecutive residues Asn43 and Asn44 in RNase T1 could hardly be more 
different. A value of 2.94 Hz for °Juncg in Asn43 (Fig. 5A) contrasts the lowly 0.47 Hz in 
Asn44 (Fig. 5B). Very different fcacg couplings also suggest differing backbone geometries. 
While the near-average 'Jcycx coupling of 35.8 Hz in Asn43 is common with negative p 
torsions, the unusually large 43.7-Hz coupling in Asn44 agrees better with a positive ọ value 
(Schmidt et al., 2009). Both being asparagines, this cannot be a residue-type specific effect on 
the J couplings (Schmidt, 2007a). 


The above-average value of 2.66 Hz seen for ’Jecp in Tyr45 (Fig. 5C) is consistent with a 
type-I B turn spanning both Tyr45 and Glu46. This tyrosine’s aromatic ring system is a 
critical component in nucleotide recognition and binding. 


The consecutive residues Ala87 and Gly88 in RNase T1 form a B bulge (Chan et al., 1993). 
An unusually small *Junc coupling near zero in Ala87 (Fig. 5D) and an unusually large 2.9- 
Hz Jec coupling in Gly88 (Fig. 5E) hint at torsion angles near +90°/-90° and 180° (Fig. 2), 
respectively, corroborating the distorted geometry in the central portion of a B strand. 


tit} || (deg) ~ AN3- / Witz) |_| NBE / | NS J NH) | tz), 
3JHNHa -60° 5.67 -0.71 3.37 9.74 3.63 
ee 180° 1.79 -0.75 1.56 4.09 0.64 
fancy 60° 2.32 -1.64 191 5.86 0.54 
ne 120° 3.21 -2.17 205 7.43 1.10 
es 0° 1.43 -0.96 0.76 3.15 0.58 
Tee -120° 1.33 -0.51 0.91 2.75 0.62 


Table 2. Karplus coefficients for Eq. 1, optimized against experimental ?J data for RNase T1 
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RNase T1 - Tyr45 @ F1=59.79, F2=177.47 ppm, F3=8.58 ppm 
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Fig. 5. Example 2D E.COSY multiplet sections from 3D NMR spectra recorded for RNase T1. 
Panels A-C (previous page): °Juncp and °Jccg evaluations exploiting the !Jcacp coupling of 
typically 35 Hz; Panels D-E: °Jpnc and 3Jcrc evaluations exploiting the 'Jcyc coupling of 
typically 53 Hz. Solid and dotted lines are positive and negative contours, respectively. 


Eventually, the 82 p torsion angles were fitted simultaneously to the pool of all 512 °J data. 
Self-consistently optimized in conjunction with the torsion-angle values, the Karplus 
curves shown in Fig. 6 and respective coefficients summarized in Table 2 represent the 
best fit to the °J data available for RNase T1 exclusively, yet, would be similar with other 


proteins also. 
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Fig. 6. Optimization of protein p torsions on the basis of °J coupling constants. XR panels: 
Experimental data for RNase T1 (dots) plotted against initial torsion values calculated from 
crystal structure coordinates (PDB-9RNT). SC panels: The same data plotted against torsion 
values iteratively optimized by referring exclusively and simultaneously to all °J data, 
adjusting both torsion angles and Karplus coefficients in a self-consistent manner (Schmidt 
et al., 1999). Arrows point at Asn44 data that appear to be outliers initially. 
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Fig. 7. Comparison of mainchain torsions þ in RNase T1 as inferred from self-consistent J 
coupling analysis and from crystallographic data (PDB-9RNT) by Martinez-Oyanedel et al. 
(1991). The majority of items agree within a tolerance of +30° (dashed bounds). In the final 
optimized set, the Asn44 torsion angle has flipped from -86° to +37°. 


X-ray 


Two facts rationalize the markedly reduced scatter seen in the right-hand-side 7JuyH, panel 
in Fig. 6 compared with the other five coupling types: Firstly, being larger than 3Juc or 3Jcc 
on an absolute scale, °Jr values dominate the fit and, secondly, p torsion values in regular 
protein structure typically fall on the steep slope of the Jmn curve, making regression 
sensitive to small changes in °JyH. The residual scatter is due to unaccounted substituent 
effects in the different amino-acid types (Schmidt, 2007a), unaccounted torsion-angle 
dynamics (Briischweiler & Case, 1994; Pérez et al., 2001), and random experimental error. 


Assigned to Asn44 in RNase T1, the solitary data point in the left-hand-side °Jc1,() panel in 
Fig. 6, near the upper border, is elevated by around 3 Hz above the crowd. Apparently, 
other large °JcH, coupling constants around 6-7 Hz connect with positive $ torsion angles. 
Contrasting the negative þ value found for Asn44 in crystal structure PDB-9RNT (Martinez- 
Oyanedel et al., 1991), the conformation in solution that emerged from our J coupling 
analysis differs from that in the solid state (Fig. 7). Similarity between relative J coupling 
values for Asn44 and those residues that exhibit positive @ angles clearly suggest that 44 be 
rotated to a positive value also. Indeed, self-consistent optimization of  torsions referencing 
the set of six o-related coupling types converges at +37° for Asn44, while improving 
dramatically the error between observed coupling constants and those predicted from the 
crystal-structure angles. 


The region around residues 42-48 of the enzyme engages in the recognition and binding of 
the substrate nucleotide (Fig. 8). Inspection of eight crystal structures of RNase T1 in a 
variety of complexes reveals that Asn44 adopts a negative ¢ value in only those structures 
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presenting a free binding site or, rather, a calcium, zinc, or vanadate metal ion bound to the 
apo-enzyme (9RNT, 8RNT, 3RNT). Similarly, the inadequate adenosine nucleotide complex 
(6RNT) also resembles the apo-enzyme. Even though the distant ribose phosphate interacts 
with catalytic residues His40 and Glu58 in this latter structure, the nucleobase is directed 
away from the enzyme, so as to leave the recognition loop unoccupied, for RNase T1 
binding is G not A specific. 


However, four other crystal structures of RNase T1 in various complexes, notably those with 
a bound guanine nucleotide (IRLS, 1RNT, 2RNT, 5RNT), do exhibit positive ọ angles for 
Asn44 (Fig. 7). Also b angles in the adjacent positions Asn43 and Tyr45 differ between the 
two groups, by approximately 40° and 10°, respectively. 


The observations suggest that binding of the correct substrate elicits a concerted 
conformation change involving torsion ¢44, and likely was, too, as well as torsions wa3 and ¢as 
in the preceding and subsequent residue, respectively. Yet, the NMR evidence tells a 
different story: Torsion angle $44 is positive already in our sample of the free enzyme in 
aqueous solution that was subjected to NMR measurement! Conformational variability of 
the Asn44 backbone, such as continual flips between positive and negative values, can be 
ruled out on the basis of the large °Jc1,, coupling value observed, as such conformational 
averaging would reduce the J value towards its mean of about half the size (Table 2). 


A likely explanation for the negative angle value in some of the crystal structures would be 
packing effects through molecules in adjacent grid cells distorting the conformation of the 
rather exposed 42-48 loop region at the protein surface. Another possible explanation could 
be that the metal agents added to aid the crystallization process obstruct the binding site 
differently than the natural substrate would do. 


Fig. 8. Stereo view of residues 42-48 in the crystal structure of the RNase T1:2’GMP inhibitor 
complex (PDB-1RNT, Arni et al., 1988). Involved in nucleotide recognition and binding, 
Asn43 and Asn44 backbone amide groups interact with the guanine five- and six-ring, 
respectively, while the Glu46 sidechain carboxylate probes the presence of the correct 
hydrogen bonding capacity at the Watson-Crick edge of the guanine. Tyr42 and Tyr45 
sandwich the guanine ring plane. 


3.2 Flavodoxin 


3] coupling data from a similar study targeting backbone torsion angles ų in flavodoxin 
suggested subtle differences between the structures of this protein in aqueous solution 
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(NMR) and in the crystalline solid state (X-ray). For example, on the basis of J coupling 
constants the orientation of peptide plane Ala39/Ser40 shown in the X-ray structure needed 
to be adjusted by about +10° in order to fulfil the experimental data (Löhr et al., 2001). 


A conformational hot spot is found in flavodoxin’s so-called “60-loop”, the region protruding 
around the isoalloxazine ring system, made up of Gly61-Asp62-Asp63-Ser64. Results from our 
analysis on the exclusive basis of J-coupling constants agree with what has been known for 
some time already from X-ray studies on flavodoxin (Watenpaugh et al., 1976). These crystal 
structures of flavodoxin can be grouped according to 2FX2, 3FX2, 1J8Q, and ‘WFX2’1, on one 
side, and 4FX2, 5FX2, 1BU5A, and 1BU5B, on the other. The former group represents oxidized 
species of the FMN cofactor, the latter includes reduced (5FX2) and semi-reduced (4FX2) 
quinone forms. 1BU5A and 1BU5B (Walsh et al., 1998) are two chains in an asymmetric unit of 
apo-flavodoxin-riboflavin complex lacking the ribose phosphate. 


Conformations of torsions we: and 62 in flavodoxin differ by almost 180° between both 
structure sets, implying a flip of the intervening peptide plane with connected changes in 
the hydrogen bonding network. Our NMR data on the oxidized species support the torsion 
angles derived from the first group. Most notably, Asp62 adopts a rare positive b torsion 
angle responsible for an elevated ĉ°Jcna coupling constant of 5.1 Hz. As it is also connected 
with an unusual negative y torsion, this data point stands out in a graph of 3Jcu.() (not 
shown). Albeit a non-glycine residue, Asp62 exhibits a secondary-structure feature normally 
characteristic of glycine as frequently found in a type-II’ P turn motif (Creighton, 1993). But 
then, the surrounding torsion angles are too distorted to form a proper reverse turn. 
Variability in this loop region is also corroborated by an analysis of coupling constants 
related to the sidechain y; torsion angle (Schmidt, 2007b). 


4. Discussion, conclusion and scope 


Complete determination of a protein fold from scratch is inherently impossible on the sole 
basis of J coupling restraints alone, owing to their short-range nature of interaction. Even 
though ọ torsion angles can be determined fairly accurately, and values of y can at least be 
narrowed to plausible ranges, while fixing  torsions at the planar 180° value, the chaining 
of the distinct amino-acid fragments is likely to cause errors to accumulate in the process. A 
few long-range restraints, such as those provided by the measurement of NOE effects, will 
normally be required to ensure the correct fold over the whole polypeptide chain. In fact, 
the present results derived from our J-coupling data are consistent with an independent 
previous investigation into the RNase T1 solution structure by means of the traditional 
measurement of NOE effects that were subsequently converted into proton-proton distance 
constraints (Pfeiffer et al., 1996a). 


4.1 Data correlation and redundance 


The ọ torsion angle in each amino-acid residue is supported by up to six °J coupling 
constants, which, in aggregate, determine just one internal rotational degree of freedom of 
the molecular model, the value of ọọ. Even though the J data are all independent 


1 “WFX? signifies a preliminary X-ray coordinate set of D. vulgaris flavodoxin not available from the 
PDB and kindly provided by Martin Walsh, University College Galway, Ireland. 
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observations, they do not represent six independent structure parameters. Owing to their 
simultaneous dependence on the same torsion, their values are highly correlated, or anti- 
correlated, for that matter (Table 3). Somewhat limited by pairs of couplings exhibiting 
correlated angle dependence (Fig 2), the amount of independent structure information 
available reduces to effectively 3 or 4, rather than 6, yet still exceeds the single rotational 
degree sought to be fixed. Precisely this redundance, however, is key to determining torsion 
angles accurately. 


SJHNHa SJENC’ 3JHNCp lease Jee 3Jecp 
TA 6.77+2.12 -082 -1.20 1.18 0.81 dds 
Stic -44% 0.89+0.74 -0.11 0.27 0.27 -0.19 
Jaa -79% -2% 1.62+0.85 -0.82 -0.62 0.77 
ii 54% 8% -61% 2.32+1.29 0.47 -0.84 
Jor 58% 16% -76% 30% 0.97 +0.58 -0.59 
J -79% -6% 89% -69% -77% 1.83 + 0.78 


Table 3.Variance-covariance matrix for 512 -related °J coupling constants in RNase T12 


4.2 Molecular dynamics effects 


At times, the excess information contained in a set of observables can give insights into 
dynamic effects that may prevail in the molecular structure (Schmidt, 1997b). 
Conformational dynamics frequently complicate the analysis of amino-acid sidechain 
torsion angles yı and need be taken into account for satisfactory interpretation of 
experimental J data related to that angle type. A variety of angular-mobility models can be 
applied to cases in which a single fixed torsion does not explain the observed data 
satisfactorily. Analyses commonly assume the x, torsion either to dwell preferentially in 
energetically favourable staggered states (Pachler, 1963, 1964; Hansen et al., 1975) or to 
librate about a mean value according to a Gaussian probability profile (Jardetzky, 1980; 
Karimi-Nejad et al., 1994; Briischweiler & Case, 1994). 


In the first approach, amino-acid sidechain torsions are analyzed assuming the presence of 
interconverting staggered-rotamer conformations of yı = -60°, +180° and +60° to solve for 
the respective populations, pı, p2, and ps, by linear combinations of so-called trans and gauche 
coupling values (Table 2). The Gaussian model, however, typically limits the torsion to one 
predominant conformation, yet, allowing for larger angular variability, at times. To this end, 
angular standard deviations of approximately 60° can be considered to represent a fully 
revolving torsion. Both models are somewhat complementary, the staggered-rotamer one 
being easier to apply, whereas the Gaussian one usually fits the data better (Schmidt, 1997b). 


2 Diagonal: mean and standard deviation (square-root of variance, in Hz). Upper triangle: standard 
deviation attributed to joint variation in both J types (square-root of covariance, in Hz), negative signs 
indicating antivariance. Lower triangle: pair-correlation coefficients. 
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4.3 ‘J and 7J couplings in protein structure analysis 


Interest is arising in short-range !J and ?J coupling constants (Wienk et al., 2003; Löhr et al., 
2011) as these are comparatively easier to measure than %J, albeit less well understood 
regarding their conformational dependence (Schmidt et al., 2009, 2010). However, by simply 
recognizing qualitative classes of small, medium, and large magnitudes for the respective 
parameters, it was feasible to pinpoint the boundaries of secondary-structure elements in a 
protein (Schmidt et al., 2011). The suggested procedure of J-indexing also exploits data 
redundance as each torsion angle is surrounded by large numbers of !J and ?J coupling 
constants (Fig. 9). 


Fig. 9. The ten types of *J coupling constants encountered in the protein backbone. 


Protein 'J-, ?J-, and %J-coupling constant data related to one-, two-, and three-bond 
interaction, respectively, continue being deposited by NMR spectroscopists with the 
Biological Magnetic Resonance Data Bank (BMRB, Ulrich et al., 2008) (Fig. 10). 


4.4 Accuracy issues 


Quantitative %J-coupling analysis in proteins appears to have reached a level of detail and 
accuracy at which a change of a few degrees in a torsion angle, comparable to thermal 
librational amplitudes, makes a noticeable difference, allowing, for example, genuine 
differences between NMR-based solution and X-ray based crystal structures to be detected. 


The process of defining useful conformational constraints on the basis of J coupling 
constants usually requires, first, extracting accurate values for the spin-system related 
property J from NMR spectra and, second, translating these values into molecular-geometry 
related dihedral-angle values within the framework of a specified model of molecular 
structure and possibly dynamics. Naturally, both these stages come with their inherent 
inaccuracies which will impact on the final result (Fig. 11). 
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Fig. 10. Cumulative data volume submitted to the BMRB in category protein J coupling 
constants. The current number of deposited sets is around 350, totaling more than 25,000 
values, the majority of which are °Junna coupling constants. Recent leaps reflect large-scale 
depositions of hitherto less popular ‘J and ?J parameters which the authors of this chapter 
have set out to explore regarding utility in protein structure determination (Schmidt et al., 
2009, 2010). 
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Fig. 11. Progression of procedural errors and random uncertainties into molecular structure. 
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A word of caution is advised though. Even if both above mentioned evaluation stages as 
well as the recording of NMR spectra were void of systematic error, i.e., not limited by their 
technical approach, molecules don’t always do us the favour of revealing an unambiguous 
structure. If conformational dynamics are at play, such ambiguity is not to be confused with 
lack of accuracy or precision, and the geometry of the molecule must not be squeezed into a 
‘strait-jacket’, as it were. Rather we need to develop our understanding of molecular 
structure, if not re-define the term, in a way that integrates both dynamic as well as static 
aspects of conformation. Research into partially or intrinsically disordered protein 
structures is witness to such development (Dyson & Wright, 1998). 


5. NMR Experiments 


J coupling measurements were carried out on NMR spectrometers manufactured by Bruker, 
Rheinstetten, Germany, with nominal fields ranging from 500 to 900 MHz proton frequency. 
All instruments were fitted with cryogenically cooled triple-resonance z-axis pulsed-field- 
gradient probes, except the 500-MHz instruments were equipped with room-temperature 
triple-resonance three-axis pulsed-field-gradient probes. 


Experiment names of the exclusive correlation spectroscopy (E.COSY) follow a convention 
that identifies by uppercase lettering those protein nuclei that are active in generating the 
spectrum dimensions, in round parentheses those used for magnetization relay only, and in 
square brackets the presence of those passive spins that give rise to the sought J coupling 
interaction (Wang & Bax, 1995). 


Each coupling constant was measured at least twice, and mean values quoted carry two 
standard deviations, the former denoting the intra-spectrum, i.e., variation across all amino- 
acid residues, and the latter denoting the standard error or inter-spectrum rms variation 
between the repeat experiments, i.e., a measure of reproducibility. 


5.1 3J(HN,H®) coupling constants 


3JunHa coupling constants originated from 3D-HA[HB,HN](CACO)NH quantitative 
J correlation spectra recorded at 500 MHz (Löhr et al., 1999). The coupling constant for 
residue i was evaluated in spectrum planes at the resonance N; by summing the data 
points comprising the HN;,HN;+ı cross peak and referring its intensity to the H*,HN;+ı auto 
peak according to Icoss/Iauto = -tan?(zJA), with the magnetization-transfer delay, A, set at 
17 ms. Raw coupling constants were corrected for H/D exchange by an average factor of 
1.049, due to scaling all Icoss/Iauto ratios for the 10-% DO solvent component needed for 
field-frequency locking. Both datasets yielded 77 values of corrected łJanua averaging 6.77 + 
2.17 + 0.24 Hz. 


5.2 3(HN,C’) coupling constants 


3Junc' coupling constants were determined by least-squares fitting 2D projections of 
E.COSY-type C*,H™ multiplets (Schmidt et al., 1997a) recorded at 900 MHz using a [15N,1H]- 
TROSY variant (Pervushin et al., 2000) of 3D-(H)CANNH[CO] spectra (Lohr & Riiterjans, 
1995). To minimize transverse relaxation effects during the constant-time period, C* 
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chemical shifts and 1Jcac couplings were simultaneously evolved in a 1H*,SC* multiple- 
quantum constant-time t period while spin-locking 'H" magnetization using a 3-kHz spin- 
lock field. Insufficient to cover the complete H” shift range uniformly, poorly decoupled C* 
resonances near the excitation band edges were omitted from evaluation. Glycine patterns 
were disregarded entirely as the NMR experiment is not optimized for their twin-H® spin 
topology. The highly reproducible results comprised 70 values of ?Junc averaging 0.75 + 0.71 
+ 0.06 Hz. For many of those residues that needed to be ignored in (H)CANNH evaluations 
for reasons of lineshape distortion or overlap, 3Junc coupling constants were obtained using 
a simpler 3D-heteronuclear relayed-E.COSY approach at 500 MHz in combination with 
more sophisticated antiphase 2D-multiplet line-shape analysis (Schmidt et al., 1996). The 76 
values averaged 0.86 + 0.72 + 0.17 Hz, slightly larger than those from (H)CANNH. 
Additional values were extracted from a [!5N,!H]-TROSY version of ct-HNCA[CO]-E.COSY 
spectra (Wang & Bax, 1995) recorded at 600 MHz, averaging 0.81 + 0.79 + 0.21 Hz and 
bringing the total number of items to 92, including data for glycine. Rmsd between all six 
spectra was 0.22 Hz. 


5.3 3(HN,C*) coupling constants 


3Juncg coupling constants were determined by simulating 2D projections of C*,H™ multiplets 
resulting from a [!°N,1H]-TROSY version of the 3D-HNCA[CB]-E.COSY experiment (Wang 
& Bax, 1996) carried out at 800 MHz. Glycines generally lacking the C? nucleus were 
ignored. Overlapping C*; and C°j-1 resonances prevented a few coupling constants from 
being obtained. Coincident C* and C? chemical-shift ranges in serine and threonine residues 
precluded selective excitation of either nucleus and respective data were disregarded. Thus, 
the two recordings yielded only 58 values, averaging 1.50 + 0.94 + 0.09 Hz. Additional ?Jrncp 
coupling constants resulted however from 3D [!5N,‘H]-TROSY-HNCB quantitative J 
correlation experiments (F. Lohr, unpublished) performed at 600 and 800 MHz. Dephasing 
of 1HN coherence due to passive 9JHNH, couplings during J evolution delay A, which was set 
at either 40 or 45 ms, was avoided by employing BIRD refocusing elements (Garbow et al., 
1982) in the centre of the A period. Three spectra yielded 68 values of 3Juncp averaging 1.71 + 
0.79 + 0.18 Hz between quantitative J correlation data. Between all five sets, 81 values were 
determined at 1.62 + 0.85 + 0.14 Hz. 


5.4 3J(C'i-1,H%;) coupling constants 


3JeHa coupling constants resulted from fitting 2D projections of C%;,C';-ı1 multiplets in 3D- 
H(N)CA,CO[HA]-E.COSY spectra (Löhr & Rüterjans, 1997; Löhr et al., 1997) recorded at 500 
MHz. The total 81 values collected averaged 2.33 + 1.29 + 0.16 Hz. 


5.5 3J(C'i1,C’;) coupling constants 


3Jec coupling constants resulted from fitting 2D projections of C%,C';-ı multiplets in 3D- 
H(N)CA,CO[CO]-E.COSY spectra (Löhr et al., 1997) recorded at 500 MHz. Glycine 
couplings included, the two datasets comprised 90 3Jec values averaging 0.84 + 0.65 + 0.16 
Hz. Additional Jee coupling constants resulted from quantitative J-correlation experiments 
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using a [!5N,!H]-TROSY version of the 3D HN(CO)CO pulse sequence (Hu & Bax, 1996). 
Two spectra were acquired at 500 MHz with the J-evolution period A set at either 50 or 60 
ms. The spectra allowed both sequential coupling constants 3](C’i-2,C’i1) and 3J(C’is,C’;) to 
be evaluated, yielding 78 values averaging 1.03 + 0.61 + 0.23 Hz. The final set comprised 98 
coupling constants of 0.95 + 0.63 + 0.26 Hz. 


5.6 3(C'-1,C°)) coupling constants 


3Jecp coupling constants were measured at 500 MHz by H(N)CA,CO[CB]-E.COSY (Löhr et 
al., 1997), yielding only 59 values averaging 1.96 + 0.86 + 0.16 Hz. As with Jance, 
determining °Jccg coupling constants with these E.COSY-type experiments fails for serine, 
threonine, and some leucine residues, all exhibiting downfield-shifted 13C! resonances, so as 
to overlap with C* chemical-shift ranges, preventing selective excitation. In contrast, 
HN(CO)CB quantitative J correlation (Hu & Bax, 1997) provides values for all residue types 
as long as the active coupling magnitude exceeds a certain threshold determined primarily 
by the signal-to-noise ratio. Spectra were acquired at 500 MHz with delays A set at either 
37.5 or 38.5 ms to match 2/1J(C',C’). The !8C carrier frequency was positioned at either 26 or 
60 ppm. Quantitative J correlation yielded 74 data, averaging 1.83 + 0.75 + 0.10 Hz. 
Additional *Jccg values including those in Ser and Thr residues were obtained from C%,C';-1 
multiplet projections in H(N)CO,CA[CA]-E.COSY recorded at 800 MHz (F. Lohr, 
unpublished). The 76 values averaged 1.66 + 0.80 + 0.20 Hz, and the grand average over all 
ten sets was 1.82 + 0.78 + 0.20 Hz, totaling 82 values. 


6. Supplementary material 


3] coupling constants related to the $-torsion angles in RNase T1 as determined in the 
present work are deposited with the BioMagRes Database (accession number BMRB-16469), 
available at http://www.bmrb.wisc.edu/cgi-bin/explore.cgi? format=raw &bmrbId=16469 
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